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ABSTRACT 



If one wants to represent the galaxy number density at some point in terms 
of only the mass density at the same point, there appears the stochasticity in 
such a relation, which is referred to as "stochastic bias". The stochasticity is 
there because the galaxy number density is not merely a local function of a mass 
density field, but it is a nonlocal functional, instead. Thus, the phenomenological 



stochasticity of the bias should be accounted for by nonlocal features of galaxy 
q^ \ formation processes. Based on mathematical arguments, we show that there 

are simple relations between biasing and nonlocality on linear scales of density 
*p( fluctuations, and that the stochasticity in Fourier space does not exist on linear 

scales under a certain condition, even if the galaxy formation itself is a complex 
nonlinear and nonlocal precess. The stochasticity in real space, however, arise 
from the scale-dependence of bias parameter, b. As examples, we derive the 
stochastic bias parameters of simple nonlocal models of galaxy formation, i.e., 
the local Lagrangian bias models, the cooperative model, and the peak model. 
We show that the stochasticity in real space is also weak, except on the scales 
of nonlocality of the galaxy formation. Therefore, we do not have to worry too 
much about the stochasticity on linear scales, especially in Fourier space, even 
if we do not know the details of galaxy formation process. 

Subject headings: cosmology: theory - - galaxies: statistics - - large-scale 
structure of universe 



1. INTRODUCTION 

Probing the statistical properties of the large-scale structure of the universe has a 
jreat importance in studying the origin of our universe. Recent galaxy surveys have been 
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revealing the statistical properties of distribution of galaxies on very large scales. The 
number density of galaxies are not necessarily proportional to the density of mass, and this 
ambiguity is known as galaxy biasing problem (Kaiser 1984; Davis et al. 1985; Bardeen et 
al. 1986). Since galaxies of different types have different clustering properties (e.g., Dressier 
1980; Davis & Geller 1976; Giovanelli, Haynes, & Chincarini 1986; Santiago & Strauss 1992; 
Loveday et al. 1996; Hermit et al. 1996; Guzzo et al. 1997), not all types of galaxies can 
simultaneously be unbiased tracers of mass. This ambiguity is undesirable in extracting 
cosmological information from the data of galaxy distribution. 

The simplest model for the galaxy bias is the local, linear bias. In this simple model, the 
number density field of galaxy with a fixed smoothing scale R is assumed to be proportional 
to density field of mass with a same smoothing scale: 

6 e (x;R) = b8 m (x',R), (1.1) 

where 8 g and 8 m are density contrast of galaxy and of mass, respectively, with a fixed 
smoothing length R. This model is viable if (a) 5 g (x; R) is dependent only on S m (x; R) 
and (b) density contrast of mass is sufficiently small. The latter condition (b) is achieved 
by considering large scales on which density fluctuations are small enough so that only 
linear term becomes significant. The linear coefficient b is called as bias parameter. The 
bias parameter is often assumed to be a constant, although it can depend on R in general. 
The effect of nonlinearity should be taken into account when we are interested in nonlinear 
scales. On weakly nonlinear scale, this affect the estimation of higher order statistics (e.g., 
Fry & Gaztanaga 1993). 

Although the latter condition is reasonably considered to be valid if we are interested in 
linear scales, the former condition (a) is not trivial so far. The non-triviality of condition (a) 
leads us to a concept of stochastic bias which is recently argued (Dekel & Lahav 1998; Pen 
1998; Tegmark & Peebles 1998; Tegmark & Blomley 1998; Taruya, Koyama & Soda 1998; 
Taruya & Soda 1998; Blanton et al. 1998). In the stochastic biasing scheme, 8 g (x;R) is 
not supposed to be determined solely by 8 m (x; R), but the scatter in 8 g -8 m relation is taken 
into account. In linear regime in which the density contrast of mass 8 m is small enough, 
and is approximated by random Gaussian field, the two-point statistics fully characterize 
the statistics of the scatter. In literatures, the bias parameter b and the dimensionless cross 
correlation r are used to characterize the linear stochastic biasing scheme: 



b(R) 



(8 S (*;R) 2 ) r(m (5 m (x; R)5 g (x; R)) 
\(8 m (x;R)*y [ ' cr m (R)a g (R) ' l ' ' 



where o~ m = (S 2 ^ 1 ^ 2 and a g = (S 2 ) 1 ^ 2 = ba m are rms density fluctuations of mass and 
galaxies, respectively. 
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The bias parameter b in the equation (|L2) is a generalization of the bias parameter 
in linear deterministic biasing scheme of equation ( |1 . 1| ) . In deterministic case, the cross 
correlation r is always unity. From Schwarz inequality, r cannot exceed unity and r = 1 
means that biasing is deterministic, 5 g oc 8 m . Thus, the deviation from r = 1 measures the 
stochasticity. If the smoothing scale R is large enough so that 5 m and 5 g are considered 
as bivariate Gaussian field, these three parameters b, r and a m contain all the statistical 
information about the stochastic biasing. 

In literatures, these parameters are sometimes considered as free parameters to be 
determined by observation. However, if we could know the process of galaxy formation 
in detail, the parameters b and r would be derived from some fundamental physical 
processes. This is because the bias and its stochasticity come from our ignorance of the 
galaxy formation. Therefore, there should be some theoretical constraint between b and 
r. At first sight, it seems difficult to find any constraint as we do not exactly know the 
process of galaxy formation. This is true especially on small scales where the nonlinear 
and nonlocal characters of galaxy formation plays an important role on the statistics of 
galaxy distributions. This problem is one of the most important issues in astrophysics and 
much numerical and analytical work is needed (e.g., Rees & Ostriker 1977; White & Frenk 
1991; Cen & Ostriker 1992; Mo & White 1996). However, on large scales, such undesirable 
characters can be expected to be small. Scherrer & Weinberg (1998), based on the local 
biasing scheme, showed that the stochasticity actually vanishes on large scales and galaxy 
autocorrelation function behaves exactly as in deterministic biasing scheme. Dekel & Lahav 
(1999) also imply the same property based on a specific simple model. 

In this paper, based on a general nonlocal method, we show that stochasticity in Fourier 
space asymptotically vanishes on linear scales under a certain condition, explicitly deriving 
the relation between the stochastic parameters b, r and the nonlinear, nonlocal functional 
form of galaxy formation. We will show the first coefficient of generalized Wiener- Hermite 
functional, which is defined below, of the nonlocal, nonlinear relations of galaxy formation 
will contribute to the galaxy statistics on large scales, if that coefficient does not vanish. 

In the derivation, we can use the technique developed in Matsubara (1995), in which 
the diagrammatic methods for the calculation of general nonlocal biasing are introduced. 
The diagrammatics are useful especially when the non-Gaussianity and/or higher-order 
correlations are interested in. In this paper, however, we derive the result without 
employing diagrammatics for self-consistency and for the simplicity of the problem. The 
diagrammatics make it easier to generalize the present results to higher-order statistics. 

In §2, we revisit the mathematical methods for nonlinear, and nonlocal bias, which have 
been developed by Matsubara (1995). Then we derive the relation between the stochastic 
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bias parameters and the nonlocality of biasing. In §3, we examine three types of biasing 
schemes, i.e., the local Lagrangian bias models, the cooperative model, and the peak model, 
according to the result of §2. In §4, we discuss the results and present the conclusions. 



2. NONLOCAL BIAS AND STOCHASTICITY 

2.1. Mathematical Formulation of Nonlocal Bias 

In the present paradigm, the distribution of galaxies is determined by initial density 
fluctuations. Whether a galaxy has formed at some place or not should be fully determined 
by the initial fluctuations. In this sense, the galaxy formation is deterministic, although 
it nonlinearly, nonlocally, and possibly chaotically depends on initial density fluctuations. 
In the stochastic biasing scheme, this complex features of nonlinearity and nonlocality of 
the galaxy formation are expressed by phenomenological scatter of local 5 g -5 m relation. 
Thus, in principle, stochasticity can be determined by nonlinear and nonlocal deterministic 
processes of galaxy formation. 

Since both the present mass density field p m and the density field of galaxies p g are 
determined by initial density fluctuations Si, they are expressed by functionals F m and 
F g . In the following, instead of the initial fluctuations Si, we will alternatively use linearly 
extrapolated density fluctuations S^ = DSi, where D is a linear growth rate. The variable <5l 
is simply a linear extrapolation of the evolution of density contrast, regardless of whether 
or not the small scale fluctuations are actually in linear regime. The introduction of the 
linearly extrapolated field is just for convenience and it simply represents the initial density 
fluctuations. In this notation, one can write the relations as 

p m (x) = F m ([S L ],x), p g (x) = F g {[S L ],x), (2.1) 

where we introduce the notation [Si] which means that F m and F g are nonlocal (and, of 
course, nonlinear) functionals of S^. These functionals also depend on the position, x. It is 
sometimes more convenient to use the density contrast of p m and p g . 

{F m ([S L },x)} {F g ([S L \,x)} 

One may want to expand these functional by Taylor series of Si,, as in usual perturbative 
approach. However, this approach would be useful only if the nonlinearities on all scales 
were small, and/or galaxy formation process were linear or quasi-linear. In reality, however, 
the galaxy formation process is a highly nonlinear process, and it depends on the nonlinear 
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behavior of mass density field in a very complex way. Thus, we cannot use Taylor expansion 
in our treatment below. Alternatively, we will find below that the expansion of these 
functional by orthogonal functionals, i.e., generalized Wiener- Hermite functionals, is useful 
for our purpose. The n-th order generalized Wiener- Hermite functional TiS n \ which is 
introduced in Matsubara (1995) for the first time, is defined by 

■1 



n {n \ 



3>1, . . . , X Tt 



cxp 
-l) n S r ' 



d 3 xd 3 yb L (x)£ L 1 (x,y)b L (y) 



rr . , ,, , v exp -- / d"xd u yd L {x)S h *{x,y)& L {y) , (2.3) 
od L (xi) ■ ■ ■ bb L (x n ) L 2 J J 

where b/bb^ is a functional derivative with regard to <5l, and £l is a two-point correlation 
function of linear density field, (,l( x , y) = (Sh(x)Si,(y)) . The inverse of correlation function 
£l\ is defined formally by 



J d 3 y$,z\x,y)^{y,z) 



b 3 (x 



(2.4) 



The original Wiener-Hermite functional is the special case when the correlation function 
is identical to Dirac delta function, C,^(x,y) = b 3 (x — y). If b^ has only finite degrees of 
freedom instead of functional of space, the generalized Wiener-Hermite functionals reduce 
to generalized Wiener-Hermite polynomials which are seen in the literature (Appel & 
Feriet 1926). For the application to modern cosmology, the generalized Wiener-Hermite 
functionals are more appropriate than the others. 

We expand the functionals F m and F g by these functionals. It is useful to define the 
following functional which is an infinite dimensional linear combination of 7i^: 



H(n){x u . . . , x n ) = J d 3 Vl ■ ■ ■ d 3 y n ^{xi, yO • • • £h(x n , y n )H {n) (y 1} . . . , y n ), 



(2.5) 



which we also call as generalized Wiener-Hermite functional. The explicit form of the first 
three generalized Wiener-Hermite functionals are 



n 



(0) 



1. 



(2.6) 

H (1) (x) = b L (x), (2.7) 

Ti(2)(xi,x 2 ) = b L (x 1 )b L (x 2 ) -£l(bi ~x 2 ), (2.8) 

H( 3 )(xi,x 2 ,x 3 ) = b^x^b^x^b^x?,) (2.9) 

- [£ L (a?i - x 2 )b L (x 3 ) + £ L (x 2 - x 3 )&l(xi) + Cl(*3 - x 1 )5 h (x 2 )]. (2.10) 



The generalized Wiener-Hermite functionals have the orthogonality relation: 
[H{n){xi, . . . , aj n )W (m) (i/!, . . . , y„ 



S n 



b 3 (x 1 - J/J • • • b 3 (x n - y n ) + sym.^!, ...,y n ) 



(2.11) 
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where sym.(...) denote the terms of symmetric permutation of the preceding term with 
respect to the indices. In this case, there are n! terms in total in the right hand side of 
equation ( |2.11| ). The Gaussian averaging (• • •) denote the averaging by linear density field 
5l by the random Gaussian probability functional P[<5l]: 



(•••)= [d5 L ]---V[8 L ], 



where 



V[5 L ] =Afexp [-- J d i xd 3 y5 L (x)^ l ( X ,y)6 L (y: 
The formal normalization constant M is given by 

N= {|[d<5 L ]exp -- J d 3 xd 3 y5 L (x)^ L 1 (x,y)5 L (y) 



-i 



(2.12) 
(2.13) 

(2.14) 



Because the degrees of freedom is infinite, Af~ x formally diverges, but the proper 
regularization is always possible by discretizing the three dimensional continuum space. 
One can prove the orthogonality equation (|2.11|) , just generalizing the proof of orthogonality 
of simple Hermite polynomial, which is well known. 



We consider the functionals of equation ( |2.5| ) as base functionals for the expansion of 
the functional of mass density field, and of galaxy density field: 



OO 1 - 

SA{r,[5 L }) = ^2~ d 3 x 1 ---d 3 x n K i A n \r-x 1 ,---,r -x n )H {n) {x 1 ,...,x n ) 

n=\ n - J 



(2.15) 



where A = m or g. The reason why n = term is not appeared in the above expression is 
that (5a) = and (H( n )) = for n > [set m = in equation (|2.11|) ]. This expansion is 
complete because the kernel K^ is uniquely given by 



Kt\r 



Xij . . . , T X n j 



n ( - n \x u ...,x n )5 A (r,[5 L ]) 

1 5 n 5 A (r,[5 L ]) \ 
SS L (xi) ■ ■ ■ SS L (x n ) /■ 



(2.16) 
(2.17) 



According to the expansion ( p.!5| ) and the orthogonality relation ( [2.11|) , the two-point 
auto-correlation function of matter £ mm and of galaxies ^ gg and the cross correlation function 
£ mg are given by 



Ub(t) = (S A (ri)S B (r 2 )) 

00 1 r 

4 / d 3 x, ■ ■ ■ d 3 x n d 3 Vl ■ ■ ■ d 3 y n K {n) <~- - ^ ^ n) 



E 

n=l 



d 3 xi ■ ■ ■ d 3 x n d 3 yi ■ ■ ■ d 3 y n K A n '(x l , ..., x n )K'g > (y 1 , ...,y r 
x £ L (r + a;i + y x )£ h (r + x 2 + y 2 )--- Zh{r + x n + y n ), (2.18) 
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where A, B = m, g, and r\ — r 2 = r. Note that this expansion is valid as long as Si, is 
a random Gaussian field. If the initial density field is non-Gaussian, there are additional 
terms in equation ( |2.1S ) which depend on initial higher-order correlation functions [see 



Matsubara (1995) for detail]. 

Both the nonlocal kernels K m and K g do not depend on r. Thus, on large scales with 
respect to the separation r, we can approximate equation (|2.18| ) by only considering lower 
order terms of £,1, provided that the kernel K^ does not have broad profile. 

If the kernel falls off slowly on large scales, we can not truncate the expansion fl2.18| ). 
In the rest of this paper, we assume the lowest order term in the expansion Q2.18| ) actually 



dominates the higher-order terms. Before proceeding to the analysis of the lowest order 
approximation, we consider the cases in which this assumption breaks down. Imagine, for 
example, that ii{r) ~ r~' y , and K^(xi, . . . , x n ) ~ (xi - • • x n )~ SA on large scales. Then the 
n-th order term in the integral in expansion ( |2.18j ) is approximately given by 

d 3 x\ ■ ■ ■ d 3 x n d 3 y\ ■ ■ ■ d 3 y n 



(a?i - - • x n ) s ^(yi ■ ■ ■ y n ) SB \r + x 1 + y x \~< ■ ■ ■ \r + x n + y n \i 
( f d 3 xd 3 y 



x SA y SB \r + x + y\ 



oc (r 6 - SA - SB -~<y . (2.19) 



The last expression is derived by the fact that the integral is the form of convolution and 
the (3D) Fourier transform of k n is proportional to r~ n-3 (Peebles 1980). As seen by this 
expression, 6 — sa — sb — 7 should be negative to ensure the equation ( |2.19| ) actually falls 
off on large scales. For example, if K^ falls off as xj~ 2 x^ 2 , and 7 < 2, then equation (|2.19| ) 
does not fall off anymore. On the other hand, if sa,b > 3, then equation ( |2.19| ) falls off 



rapidly enough so that the higher order terms can be neglected. In the latter case, the 
spatial integration of K^ is finite. It is a natural assumption that the spatial integration 
of K^ is finite. If it is infinite, as seen from equation (|2.15|) , the information of density 



fluctuations of infinitely distant places affect the galaxy formation as much as, or more than 
the fluctuations of nearer places, which is unlikely in reality. We assume sa,b > 3 in the 
rest of this paper. In other words, we assume that Fourier transform of higher-order kernels 
K^(ki, . . . , k n ) are finite in the limit ki,...,k n —>0. 



Note that the expansion ( |2.18D is essentially different from usual perturbative approach 



by Taylor expansion of density contrast itself. Instead, we employ the orthogonal expansion 
for galaxy and mass density fields, and the resulting expression, (|2.18|) , can be interpreted 
as an asymptotic expansion by correlation function, £l. Thus, we only assume the smallness 
of correlation function £l on large scales, whether the density contrast on small scale is 
large or not. 



2.2. A Sample Calculation of Kernels 



Since our argument in this section is extremely formal, it would be instructive to 
calculate an example. Consider a model in which 5 g = 5 m + 5"^ and 5 m is calculated up to 
second order in perturbation theory [e.g., Fry (1984)]: 



S m (r) 



Sh(r) + / d 3 xid 3 x 2 R{r -x x ,r- x 2 )5 L (x 1 )5 L (x 2 ), 



(2.20) 



where 
R(x 1 ,x 2 ) 



d?ki d 3 k 2 
(2tt) 3 (2tt) 3 



5 fci ■ k 2 fki h\ 2 / fei ■ k 2 

7 2k x k 2 \k 2 kj ' 7 { hk 2 



iki-Xi+iK2-X2 In 01 ^ 



and we assume the Einstein-de Sitter Universe for simplicity. Then, according to equation 
( P-17| ), we can calculate the kernels as 



K£\x) = 6 3 (x), 
K^(x 1 ,x 2 ) = 2R(x ll x 2 ) 1 

KW{x) = S 3 (x) + 2Jd 3 yR(x,y)^(y) 



+ 2 5 3 (x) j d 3 yi d 3 y 2 R( yi , y 2 )a(yi - V 2 ), 
Kf\x x , x 2 ) = 25 3 { Xl )5 3 {x 2 ) + 2R( Xl , x 2 ) 

+ AR(x u x 2 ) I d 3 y 1 d 3 y 2 R(y 1 , y 2 )^ L (Vi ~ y 2 ) 
+ 8 J d 3 y 1 d 3 y 2 R(x 1 , y 1 )R(x 2 , y 2 )^{Vi ~ V 2 )- 



(2.22) 
(2.23) 
(2.24) 

(2.25) 
(2.26) 

(2.27) 

(2.28) 



Since R(xi,x 2 ) drops off as \x±\ 3 \x 2 \ 3 on large scales, the discussion at the end of the 



previous subsection suggests that n-th order term in equation ( 2.18 ) actually drops off as 
(£l)™ on large scales. 



2.3. Stochastic Parameters in Linear Regime 

In the following, we are interested in large scales and consider only the lowest order 
approximation of equation ( |2.18| ): 



Ub(v) = f d 3 xd 3 yK { l\x)K < £\y)£ lj (r + x + y) 



(2.29) 



assuming that this term does not vanish and that the higher-order terms are negligible. 
Since this expression has the form of convolution, it becomes just products in Fourier space. 
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From statistical isotropy, the Fourier transform of K^\ denoted as K^\ is a function of 
the absolute value of the wave vector k = \k\: 

P AB (k) = K*£\k)Kg\k)P h (k), (2.30) 

where Ph(k) is the linear power spectrum^. 

The linear-scale power spectrum of mass is simply given by P mm (k) = P^(k). This 
means K$(k) = 1 on linear scales. Thus, denoting bp(k) = K^(k), 

P mm (k) = PM, (2.31) 

P mg {k) = b F {k)P L {k), (2.32) 

P gg (k) = bl(k)P L (k). (2.33) 



These equations are valid as long as K^'(k) ^ 0. If Kfp(k) = 0, equation (|2.30|) vanishes 



and is no longer the lowest order in the expansion of equation (|2.18|) . In such case, we 
should consider the higher-order terms. 

In equations ( |2.31| )-( |2.33| ), by(k) can be identified to the linear bias parameter in 
Fourier space, and is given by, from Fourier transform of equation ( |2.17| ), 

M*) = / «*"»*• (§^) = (%®) , (2.34) 

where 5i,(k) is a Fourier transform of linear density fluctuations, 5l(x). 



These simple equations (|2.31|) -( p.34 ) are the primary results of this paper. These 



equations show, as long as K]p(k) ^ 0, that there are no residual cross correlation in 
Fourier space in linear regime (except for the constant term which comes from the small 
scale behavior of correlation function) and that the bias parameter in Fourier space is b-p(k) 
which is scale-dependent and is related to nonlinearity and nonlocality of galaxy formation 
through equation ( |2.34|) . This means that the Fourier- mode stochasticity which arise from 
the nonlinearity and nonlocality of the galaxy formation vanishes in linear regime. Thus, 
the cross correlation r(k) in Fourier space should approach to unity in large-scale limit, as 
long as the galaxy formation is such that K^'(k) ^ 0. 

The case, K^(k) = can happen in special cases. It can happen, for example, when 
the large-scale linear power is completely erased by some peculiar form of nonlocal biasing, 



-"^If we consider Pab(^) as the true spectrum, there is possibly a constant term in addition to the above 
equation, which comes from the small scale inaccuracy of expression (2.29J) [see Scherrer & Weinberg (1998); 



Dekel & Lahav (1998)]. However, in the following, we consider PAB(k) as a merely mathematical quantity 
which represents just the Fourier transform of £ab(t") of the equation (]2.29[). 
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and mode-mode coupling from nonlinear scales dominates on linear scales. It also can 
happen when the biasing is represented by an even function of 5^, e.g., purely quadratic, 
8 g = 51 — (Si), or quartic, 5 g = 8^ — (Sf), etc. We assume the galaxy formation does not 
have such special form and satisfies the condition K^\k) ^ in the rest of this paper. 

The vanishing stochasticity is an important constraint on large scales. At this point, 
the naive introduction of stochasticity r in linear or quasi-linear regime, as is sometimes 
done in literatures, should be cautious. On large scales, r could not be freely adjusted, but 
would be close to unity. This fact is already noticed in the paper by Dekel & Lahav (1998), 
in which only a specific, simple model is considered. The same conclusion is derived by 
Scherrer & Weinberg (1998) based on local galaxy formation scheme. Our conclusions apply 
not only to local schemes, but also to nonlocal schemes of galaxy formation. The difference 
between local and nonlocal scheme is that local bias generates constant bias factor, while 
nonlocal bias generates scale-dependent one. 

If there is a non- unity value of r in large-scale limit, it means that there is some exotic 
stochasticity which is not relevant to the initial density fluctuations, or that Kjp(k) = 0. 
As all the structures in the universe are supposed to be formed from initial density 
fluctuations, there is no specific reason to introduce such kind of exotic processes, at least 
in the framework of the present standard theory of structure formation in the universe. 



Of course, in nonlinear regime where the approximation of equation ( |2.29|) breaks 



down, the stochasticity in Fourier space arises by mode-mode coupling. Since the dynamics 
of such nonlinear regime is complex enough to trace analytically, the concept of stochastic 
bias is useful especially in this regime. In the nonlinear regime, one should be aware that 
the nonlinearity of galaxy-density relation also dominate and that only parameters b and r 
are not sufficient to characterize the biasing properties (Dekel & Lahav 1998). 

The equations Q2.31| )- (|2.33|) is valid on scales larger than a nonlinear scale 27t//cnl, 
because these equations are derived from lowest order approximation, which assumes 
/c 3 Pl(/c) is small. If the nonlocality of galaxy formation is small and K A (x) is localized 
on some scales smaller than 27r//c g , then its Fourier transform, b-p(k), will be constant for 
k < k g . If k g > /cnl, the bias factor is constant in the validity region of the equations. This 
is equivalent to the purely local galaxy formation. In contrast, if k g < &nl, there appears 
the scale-dependence of bias parameter besides mode-mode coupling. This scale-dependence 
comes from the nonlocality of the galaxy formation. The fc-dependence of b-p(k) thus is not 
negligible on scales below 27r//c g and its behavior can be describable by equation (|2.34|) on 
scales above 27t//cnl- 

Even when there is no stochasticity in Fourier space, it may still appear in real space 
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on linear scales because of the scale-dependence of bias parameter, besides mode-mode 
coupling. Actually, the galaxy-density cross correlation and galaxy correlation for a 
smoothed field with linear smoothing length R are given by 



(S m (x;R)S g (x;R)) 



k 2 dk 



2vr 



2 P mg (k)W 2 (kR) 



k 2 dk 



2n 



2 b F (k)P L (k)W 2 (kR), (2.35) 



(5 s (x;R) 2 ) 



k 2 dk 



P gg {k)W 2 {kR) 



k 2 dk 



b$(k)P L (k)W 2 (kR), 



(2.36) 



2tt 2 ^ y ' v ' J 2tt 2 
where W is a Fourier transform of a smoothing function. This expression and equation 
flOl) explicitly show the scale dependence and stochasticity of the biasing in real space. We 
define the following notation for k-space averaging for an arbitrary function h(k): 

k 2 dk. 



h(R) 



2tt 



-h{k)P lj (k)W 2 {kR) 



k 2 dk 

~2^ 



(2.37) 



P L (k)W 2 (kR) 



Then, equation (|1.2p reduces to, simply, 



b(R) = ^b 2 (R), r(R) 



b F (R) 



b 2 (R) 



(2.38) 



To obtain more insight on this scale-dependence and stochasticity, let us expand the 
bias parameter b F (k) in terms of (k/k g ) 2 , where 27r//c g is the scale of nonlocality of galaxy 
formation as above (terms of odd power of k do not appear for reflection symmetry): 



■,(0) 



,(i) 



# 



b F (k) = b^> + b F l> (k/k g y + -i- (k/k g y + o (k/k g 



(2.39) 



With this expansion, equation ( |2.38j ) is expanded in a straightforward manner, and the 
results are 



,(0) 
F 

+ 



b(R) = b ( p' +3 1 2 b ( p ) {R,k g 

9 7 2 r 



2b 



(0) 



bpbp 



7 2 ) (b? 



yit^rCg 



0{R,k g 



r(R) 



9 



7 2 (1"7 2 ) 



b? 



(R*k g )~ 4 + 0{R,k g 



(2.40) 



(2.41) 



In this expression, spectral parameters 7 and R* (Bardeen et al. 1986, BBKS, hereafter) are 
given by 



_ k 2 {R ) 



k\R) 



R* 



3k 2 {R) 
\ F(i2) 



(2.42) 
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where notation of the equation ( |2.37|) is applied. The parameter 7 is of order unity, and i?* 
is of order R. For example, if the power spectrum has the form of power-law, Pi(k) oc k n , 
and the smoothing function is Gaussian, W(x) oc exp(— x 2 /2), these parameters are given 
by 7 2 = (n + 3)/(n + 5) and R, = (6/(71 + h)) l > 2 R. 



Equations ( |2.40|) and ( p. 41] ) give the expression for stochastic bias parameter from 



nonlocality of galaxy formation. These equations represent the minimal stochasticity 
which is inevitable when bias is scale dependent. We have neglected the mode coupling 
near nonlinear scales. The scale-dependence and stochasticity appears on the scale of 
the nonlocality, 2ir/k g . On scales larger than nonlocality, such scale-dependence and 
stochasticity disappears. Especially, stochasticity rapidly vanishes for the lack of R~ 2 term 
in equation ( |2.41| ). These results do not depend on specific details of galaxy formation 
as long as Kfp(k) 7^ 0, and higher-order terms are negligible. The information of galaxy 
formation involves only through two parameters, b F and b F , up to the order (k g R)~ 2 , or 
three parameters bp , b F and b F up to the order (/c g i?) -4 . These parameters are related to 
galaxy formation through equation ( |2.34| ) which should be calculable if we could know the 



details of galaxy formation process. Otherwise, they can be considered as free parameters to 
be fitted by observation, instead of fitting functions b(R) and r(R), which have the infinite 
degrees of freedom. 



3. SIMPLE MODELS OF NONLOCAL GALAXY FORMATION 

So far the argument is quite formal. In this section, we consider specific models of 
galaxy formation, i.e., the Lagrangian local biasing models, the cooperative model, and 
the peak model, as simple examples. Although the quantitative correspondence of these 
models and the actual galaxy formation still needs investigation, these examples can give 
qualitative aspects on the nonlocal galaxy formation. 



3.1. Lagrangian Local Biasing Models 

In the Lagrangian local galaxy formation models, the number density of galaxies is a 
local function of a smoothed linear density field. The smoothed density field, however, is a 
nonlocal function of linear density field, 5^. Thus, in some sense, local galaxy formation 
models fall in the category of nonlocal models, with particularly simple functional form: 

Pg (r) = /(Mr)), (3-1) 
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where / is an usual one- variable function of the smoothed linear density field. The smoothed 
linear density field, 5l s , is defined by 

5 Ls (r) = Jd 3 xW^(r - x)8 L {x), (3.2) 

where H/( real ) is a smoothing kernel function with smoothing length s. With this particularly 
simple form, the bias parameter in Fourier space, given by equation ( |2.34j) , reduces to 



M*) = 7T7r4^(M> (3-3) 

where W s is a Fourier transform of the smoothing kernel W^ real ) . All the other higher-order 
kernels K^ (n > 2) vanish. This result is essentially the same which is derived previously 
in real space (Szalay 1988; Coles 1993): 

(r) fWM^^Ym ( , 4) 

since (/(5 Ls )i7i(5 Ls /cr Ls )) = cr Ls (f'(5 Ls )). 

In usual nonlocal biasing models, the smoothing length s is taken to be in nonlinear 
regime. As we consider linear scales, W s (ks) ~ 1 and there is no scale-dependence on 
bp(k) and thus there is no stochasticity in real space. This result is consistent with the 
work by Scherrer & Weinberg (1998), who showed the local models produce constant bias 
factor and there is negligible stochasticity on large scales. Examples of local models are 
density-threshold bias, f(x) oc 9(x — zatls), where 6(x) is a step function (Kaiser 1984; 
Jensen & Szalay 1986), weighted bias, f(x) oc (1 + x)8(x — vcr^s) (Catelan et al. 1994), 
Cen-Ostriker bias, f(x) oc exp{C + C x ln(l + x) + C 2 [ln(l + x)] 2 } (Cen & Ostriker 1992), 
etc. In nonlinear regime, there is really scale-dependence on b-p(k) (Mann, Peacock, & 
Heavens 1998; Narayanan, Berlind, & Weinberg 1998)0. 

Recently, the Mo & White model (Mo & White 1996) for the clustering of dark matter 
halos, which is an extension of Press-Schechter formalism (Press & Schechter 1974; Cole 
& Kaiser 1989; Bond et al. 1991), is interested in (e.g., Catelan et al. 1998; Catelan, 
Matarrese & Porciani 1998; Jing 1998; Sheth & Lemson 1998). This model is also an 
example of the Lagrangian local biasing model, because the halo density contrast <5h ( 1 1 ) in 
their model is determined by linear density field 5q which is smoothed on scale Rq. 



2 Their work is based on numerical simulation and Eulerian local biasing, in which the biasing function is 
a local function of nonlinear density field. Thus, strictly speaking, their analyses are for Lagrangian nonlocal 
biasing scheme in the terminology of this paper. However, the difference between Lagrangian and Eulerian 
correlation functions is negligible (higher-order) on linear scales. 
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3.2. Cooperative Model 

Next, we consider cooperative galaxy formation model introduced by Bower et al. 
(1993), in which they showed that a large-scale (~ 20/i~ 1 Mpc), but weak modulation of 
galaxy luminosities can reconcile the discrepancy between the SCDM power spectrum and 
APM galaxy data (see also Babul & White 1991). As a simple example, they identify sites 
for galaxy formation as places where density contrast 5 satisfies the following relation 

6{r)>va-K§{r), (3.5) 

where 8 is the density contrast smoothed on a scale -R mo d, which represent the large-scale 
modulation, and k is a constant which is called as "the modulation coefficient" . This simple 
model is mathematically equivalent to the standard density-threshold bias model, but for 
the new field defined by 

S'(r)=S(r)+KS(r). (3.6) 

It is easy to see that this new field is just a new density field with a smoothing function 
W s (ks) + KW mod (kR mod ) , where W s and W mod are smoothing function for 5 and S, 
respectively. The bias parameter for the cooperative model in Fourier space is similar to 
equation (|3.3j ): 

= {d9 ^'~ yff)/ f '> [W s (ks) + KW mod (kR mod )] (3.7) 

~ v' 2 /2 

[W s {ks) + KW mod {kR mod )} , (3.8) 



' erfc (i//y/2 

where v' = ucr/J (5' ). This expression and equation ( |2.38|) enable us to evaluate the 
stochastic parameters. In Figure 1, we plot the scale dependence of stochastic parameters 
of the cooperative model. In this Figure, we assume a CDM power spectrum of BBKS 

P(k) = Ak n ( ln(1 2 + 3 4 34g) ) I 1 + 3 - 89q + (16 ' 1?)2 + (5 - 46?)3 + ( 6 - 71 ^) 4 ] ^ ' ( 3 - 9 ) 

k 

r/iMpc - 

with primordial spectral index n — 1 and shape parameter V = 0.5 and linear amplitude 
A = 3.35 x 10 5 (/i -1 Mpc) 4 which corresponds to <Tg = 1. The top-hat smoothing function is 
applied for smoothing R, while the Gaussian smoothing function is applied for smoothing s 
and R mod : 



Q = ^, ,,„„-! , (3-10) 



3 



W(x) = — (sinx - xcosx), W s (x) = W mod (x) = e x ' , (3.11 



x 3 
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and we set v = 2.8, s = 0.5/i -1 Mpc and R mo d = 10, 20, 30/i _1 Mpc. The modulation 
coefficient is adjusted so as to produce the 2.5% rms modulation of the threshold, according 
to Bower et al. (1993), i.e., Kdjva = Au/u = 0.025. This required taking n = 0.89, 2.32, 
4.35 for R mod = 10, 20, 30h~ l Mpc, respectively^. 

As one can see from the Figure, there appears the scale-dependence of bias parameter 
b(R) on scales of -R mo d so that the galaxy clustering on large scales are enhanced by 
cooperative bias. This fact is a main motivation for Bower et al. (1993) to introduce 
the cooperative model. The stochastic parameter r(R) is very close to unity except on 
the modulation scale, where there is weak stochasticity due to scale dependence of bias 
parameter. 



3.3. Peak Model 

Next example of nonlocal galaxy formation is the peak model. In the peak model, the 
sites for galaxy formation are identified as high peaks of initial density field with a fixed 
smoothing length (see BBKS). Treating the constraint properly for density peaks is difficult 
but there are several approximations. In this paper, we approximate the density peaks by 
density extrema (Otto, Politzer & Wise 1986; Cline et al. 1987; Catelan et al. 1988). The 
number density of density extrema above threshold <5t is given by 

p s (r) = 9(S Ls (r) - S t )6 3 (d t 6 Ls (r)) (-l) 3 det (did^r)) , (3.12) 

where 5i, s (r) is a smoothed linear density field with smoothing length s. Density extrema 
are identical to density peaks above some moderate threshold where almost all density 
extrema would be density peaks. 

From the general consideration in the previous section, it is straightforward to obtain 
parameters b(R) and r(R) of stochastic bias for this model. From equation Q2.34J) and after 
tedious calculation^, we obtain 



b F (k) 



a s0 H 2 {i>) 



H 3 (v) + 4^i(^ 2 

7 S 2 



W s {ks), (3.13) 



3 The small differences of our value of k and Bower et al.'s are due to different fitting formula for power 
spectrum. 

4 For interested readers, we refer Appendix E of Matsubara (1995) which contains the essential equations 
to derive the following equation. One can also show that the higher-order kernels K^ n ' of this model are 
finite in large-scale limit. 
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where W s (ks) is a Fourier transform of smoothing function for 5^ s . Other quantities in this 
expression are defined by 



o / rv drv 



J^^P L (k)W s \ks), (3.14) 

u = ^-, s* = V3^, 7s = -^L (3.15) 

O"s0 C"s2 0"s20"sO 

Hermite polynomial if„ are defined with the normalization, 

H n (u) = ^(-±Je-^. (3.16) 

Equation ( |3.13| ) explicitly show the scale-dependence of bias parameter in Fourier space. 
The scale of nonlocality corresponds to s#, which is of order of smoothing length s for 
obtaining density peaks. 

The stochastic parameters b(R), r(R) in linear regime for density peaks are derived 
from the equation (|2.38|) . Assuming v > 1, the result is 



b(R) = -^7—. {^ ~ 3) 2 + 6(i/» - S)^ (-)' + 9^ (-) 4 , (3-17) 
V 1 - 1 (T (T s o ^ 7/ Vc*/ 7 S 4 Vc*/ 

S - 3 + 3^ f x ' 



O"0O"c0 



, 2 _3)^^-3)|(|) 2 + 9|(|; 4 



(3.18) 



where spectral indices of various kind are defined as follows: 

k 2 dk 



< = I -^^P h {k)W s \ks)W\kRl c, 

/ ^k^P L (k)W s (ks)W 2 (kR), d. = v^, 

J 27T Z Oh? 



q I rC QirC 



^cl 


7c = 
7d = 


u cl 


(3.19) 
(3.20) 


> ! 

0"c2 

Cdl 


0"c20"cO 



0"d2 LT d2 CT d0 

Equations ( |3.17| ) and (|3.18|) describe stochastic parameters in real space. 



It is known that if we take both high threshold limit, v — > oo, and large scale 
limit, R — > oo, the correlation function of the peak model reduces to that of linear bias, 
£gg ~ (^/co^mm (Kaiser 1984; BBKS 1986). This property is easily confirmed from 
equation (|3.17| ), where cr c0 = cr<io = Co for R — > oo. 

In Figure 2, these equations are plotted for v = 3.0 with s = 1, 5, lO/i^Mpc, where 



we assume CDM power spectrum of BBKS, equation ( |3.9| ), with primordial spectral index 
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n = 1 and shape parameter T = 0.25 and linear amplitude A = 1.591 x 10 6 (/i -1 Mpc) 4 
which corresponds to erg = 1. The top-hat smoothing function is applied for smoothing R, 
while the Gaussian smoothing function is applied for smoothing s: 

W(x) = -^(sinx - xcosx), WJx) = e~ x2/2 . (3.21) 

x 6 

The large smoothing lengths, s — 5, 10/i -1 Mpc do not correspond to galaxy formation, but 
rather they correspond to cluster formation, since the cluster of galaxies are density peaks 
of large smoothing length. 

As seen in the Figure, once the smoothing scale R exceeds the smoothing length 
s, which is the scale of galaxy or cluster formation in this model, the parameter r of 
stochasticity rapidly converges to unity, which means there is no stochasticity above that 
scale. As stochasticity vanishes, the bias parameter b converges to a constant on large 
scales. 



4. DISCUSSION AND CONCLUSIONS 

In this paper we explicitly derive the stochasticity parameters of the bias in linear 
regime from the nonlocality of the galaxy formation. By using the generalized Wiener- 
Hermite functionals, we can derive the two-point correlation on linear scales which is 
valid even if the galaxy formation process itself is both nonlinear and nonlocal. This is in 
contrast to the usual Taylor expansion which can not treat the strongly nonlinear features 
of galaxy formation. Wiener-Hermite functionals are orthogonal functionals and we do not 
have to assume the smallness of Si, itself, and even do not have to know the exact nonlinear 
evolution of density contrast, S m . Instead, we assume only the smallness of correlation 
function on large scales. 

We show that the stochasticity in Fourier space does not exist in linear regime (except 
for the constant term which comes from the small scale behavior of correlation function), 
and that the biasing parameter in Fourier space bp(k) is given by K^\k). This conclusion 
is true as long as the galaxy formation process satisfies the relation, Kjp(k) ^ 0, and 
higher-order kernels K^ (n > 2) do not increase with scales when k — > 0. This property in 
Fourier space is simply because the galaxy-galaxy and galaxy-mass correlation functions can 
be expressed as convolutions of mass correlation function at lowest order of the expansion 



by £l [equation ( 2.29 )1 - A local model of galaxy formation has a constant bias factor, 
while the nonlocal model has a scale-dependent one, besides mode-mode coupling. In the 
linear regime where mode-mode coupling is negligible, stochasticity in real space comes 
simply from the scale-dependence of the biasing when the galaxy formation is nonlocal 
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and K^(k) 7^ 0. Thus, naive introduction of stochasticity in Fourier mode in the linear 
regime should be avoided. One cannot introduce stochasticity in Fourier mode in the linear 
regime simply because of the lack of knowledge about galaxy formation. If there is any 
stochasticity in Fourier mode in the linear regime, it means that there are exotic process 
in the galaxy formation which does not come solely from the initial density field and such 
process should be correlated on linear scales, unless galaxy formation process has a special 
form to satisfy K^(k) = 0. Such kind of exotic process is not likely, at least in the present 
framework of the standard theory of structure formation in the universe. 

We should note our analyses are restricted to the linear regime. In the nonlinear regime, 
there are mode coupling from both nonlinearity of density evolution and nonlinearity 
of galaxy formation and it makes the Fourier mode stochastic. In the region where 
stochasticity is prominent, the nonlinear density evolution, which is difficult to track 
analytically, is also prominent, so that the phenomeno logical approach of stochastic bias 
should be effectively applied in nonlinear regime (Dekel & Lahav 1998). 

In strongly nonlinear regime, phenomenological approach by the (hyper) extended 
perturbation theory (Colombi et al. 1997; Scoccimarro & Frieman 1998) can shed light 
on how nonlinearity makes cross correlation r deviate from unity. In this theory, the 
higher order cumulants of mass density field is given by (S n ) c = S n cr 2n ~ 2 , where S n 
is a constant predicted by tree-level perturbation theory, and a 2 = (S 2 ). Although 
this theory contains an extrapolation of weakly nonlinear result to strongly nonlinear 
regime, it phenomenologically describe the numerical results. In strongly nonlinear 
regime, a mere averaging and cumulant are approximately equivalent in this ansatz: 
(5 n ) = (8 n ) c + 0(a 2n ~ 4 ) = S n a 2n ~ 2 + 0(a 2n ' 4 ). Thus, if 5 g = 5 n - (<5 n ), one can 
obtain (<$*) = S 2n a 4n ~ 2 + 0{a 4n ~ 4 ) and (8 g S) = S n+1 a 2n + 0(a 2n ~ 2 ). Finally, one has 
r = S n+ \/ \fS2n. This value of r depends on the scale, and departs significantly from unity 
on small scales. 

The conclusion that the stochasticity is weak on linear scales is good news for 
determining the redshift distortion parameter (3 = Q os /b on linear scales from a redshift 
survey. The linear redshift distortion of power spectrum is given by, in the plane-parallel 
limit (Kaiser 1987; Pen 1998), 



P«(*) 



„^°- 6 2 




PM, (4.1) 



where P( s \k) is redshift-space power spectrum of galaxies, and // is a direction cosine of 
the angle between the wave vector k and the line of sight [see Hamilton (1992) for an 
expression for two-point correlation function and Szalay, Matsubara & Landy (1998) for 
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its generalization to non-plane-parallel case]. Since we see r(k) = 1 on linear scales except 
some special cases, we do not need to fit r from the observation when we use only the linear 
redshift distortion. However, the previous analyses so far usually assume the bias parameter 
b as a scale-independent constant. This assumption is justified if the scale of nonlocality of 
galaxy formation is actually below the linear scale. If it is not, the scale-dependence of b(k) 
should also be determined by observation (or by theories, if possible). 

The forthcoming large-scale redshift surveys will reveal the galaxy distribution 
especially on linear scales, on which we have not had sufficient data so far. As shown in 
this paper, the linear clustering properties are analytically tractable even when the galaxy 
formation itself is a too complex phenomenon to analytically track. The exploration of 
linear-scale galaxy distribution can overcome our ignorance of detailed galaxy formation 
processes, and will give a great insight on the primordial features of our universe. 

We thank the anonymous referee for much detailed comments to improve the original 
manuscript. This work was supported in part by JSPS Postdoctoral Fellowships for 
Research Abroad. 
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Fig. 1. — The stochastic bias parameters b(R) and r(R) for the cooperative model are 
shown. Underlying density fluctuation is assumed by CDM model with shape parameter 
T = 0.5 and amplitude <x s — 1. Gaussian smoothing scale and the threshold are fixed as 
s = 0.5/i~ 1 Mpc, v = 2.8, while the three different modulation scale R mo d = 10/i _1 Mpc (solid 
line), 20/i~ 1 Mpc (dashed line), 30/i~ 1 Mpc (long-dashed line) are plotted. The modulation 
coefficient is adjusted so as to give a 2.5% rms modulation of the threshold: k = 0.89, 2.32, 
and 4.35, respectively. 
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Fig. 2. — The stochastic bias parameters b(R) and r(R) for a peak model are shown. 
Gaussian smoothing scale s which defines density peaks are varied as s = 1, 5, lO/i^Mpc, 
which are displayed by solid lines, short dashed lines, and long dashed lines, respectively. 
The threshold for density peaks is fixed as v = 3.0. 



